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DETAILED ACTION 

Response to Amendment 

1 . In response to the Office Action mailed 12/10/08, applicant has submitted 
an amendment filed 2/26/09. 

Claims 29-30, 32-34, 46, and 47 have been amended. Claims 26-28, 31 , 
35-45, and 48-58, have been cancelled. Neew Claims 59 and 60 have been 
added. 



Response to Arguments 

1 . Applicant's arguments with respect to claims 29-30, 32-34, 46-47, have 
been considered but are moot in view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for 
all obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described 
as set forth in section 1 02 of this title, if the differences between the subject matter sought to 
be patented and the prior art are such that the subject matter as a whole would have been 
obvious at the time the invention was made to a person having ordinary skill in the art to which 
said subject matter pertains. Patentability shall not be negatived by the manner in which the 
invention was made. 

3. Claims 29-30, 32-34, 46-47, are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Chang (US 2003/0171931), in view of Nguyen et al. (US 
6,263,309), hereafter Nguyen, and Rigazio et al. (US 6,526,379), hereafter 
Rigazio. 
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As per Claim 29, Chang teaches a standard model creating apparatus for 
creating a standard model which shows an acoustic characteristic having a 
specific attribute and is used for speech recognition in an electronic apparatus 
used by a user, the standard model creating apparatus comprising: ("recognition 
model... acoustic model... customized to a user", paragraph 1; "frequencies", 
paragraph 67), the standard model creating apparatus comprising: 

a reference model storing unit configured to store a plurality of reference 
models which are models showing an acoustic characteristic having a specific 
attribute ("plurality of different cohort models", paragraph 40; "enrollment data", 
paragraph 39; "selects the speakers... closest to enrollment data", paragraph 42; 
"plurality of different cohort models", paragraph 40; "enrollment data", paragraph 
39; "selects the speakers... closest to enrollment data", paragraph 42; "top N 
possible cohort models", paragraph 51; where the speaker's voice qualities are 
attributes that are targeted by speaker dependent recognizers) 

a standard model creating unit configured to create the standard model by 
calculating parameters of the standard model using parameters of the plurality of 
reference models stored in said reference model storing unit ("parameters for 
possible cohorts are generated", paragraph 53; "modifies the parameters... using 
the parameters in the... cohort models", paragraph 57) 

wherein said standard model creating unit includes: a standard model 
structure determining unit operable to determine a structure of the standard 
model which is to be created ("parameters for possible cohorts are generated", 
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paragraph 53; "modifies the parameters... using the parameters in the... cohort 
models", paragraph 57) 

an initial standard model creating unit operable to determine initial values 
of the parameters specifying the standard model whose structure has been 
determined ("parameters for possible cohorts are generated", paragraph 53; 
"modifies the parameters... using the parameters in the... cohort models", 
paragraph 57) 

a parameter estimating unit configured to estimate and calculate the 
parameters of the standard model so as to maximize or locally maximize a 
probability or likelihood of the standard model, whose initial values have been 
determined, with respect to the plurality of reference models ("parameters for 
possible cohorts are generated", paragraph 53; "modifies the parameters... using 
the parameters in the... cohort models", paragraph 57; "likelihood", paragraph 55) 

Chang fails to teach the standard model creating apparatus using a 
probability model that expresses a frequency parameter showing an acoustic 
characteristic as an output probability, where the reference models are 
probability models, where the parameters are statistics, wherein the plurality of 
reference models and the standard model are expressed using at least one 
Gaussian distribution, and said standard model structure determining unit is 
operable to determine at least a number of Gaussian mixture distributions as the 
structure of the standard model. 

Nguyen teaches the standard model creating apparatus using a probability 
model that expresses a frequency parameter showing an acoustic characteristic 



Application/Control Number: 10/534,869 Page 5 

Art Unit: 2626 

as an output probability, where the reference models are probability models, 
where the parameters are statistics, wherein the plurality of reference models 
and the standard model are expressed using at least one Gaussian distribution, 
and said standard model structure determining unit is operable to determine at 
least a number of Gaussian mixture distributions as the structure of the standard 
model ("training speakers... speaker dependent [SD] models", col. 4, lines 38-52; 
"supervector for each speaker comprises an ordered list of parameters... 
corresponding to at least a portion of the parameters of the Hidden Markov 
models", col. 4, lines 53-64; "new speaker... compute statistics... each sound 
unit", col. 5, lines 42-56; "HMM... observable outputs... transition probabilities", 
col. 3, lines 17-43; "Gaussian distributions... probability distribution... Gaussian 
function... parameter-based speech modeling", col. 4, lines 3-36; "Alabama 
female accent", col. 7, lines 5-13; where parameters generally include frequency 
parameters when analyzing speech [speech is acoustic]) 

wherein the plurality of reference models and the standard model are 
expressed using at least one Gaussian distribution, and said standard model 
structure determining unit is operable to determine at least a number of Gaussian 
mixture distributions as the structure of the standard model ("training speakers... 
speaker dependent [SD] models", col. 4, lines 38-52; "supervector for each 
speaker comprises an ordered list of parameters... corresponding to at least a 
portion of the parameters of the Hidden Markov models", col. 4, lines 53-64; "new 
speaker... compute statistics... each sound unit", col. 5, lines 42-56; "HMM... 
observable outputs... transition probabilities", col. 3, lines 17-43; "Gaussian 
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distributions... probability distribution... Gaussian function... parameter-based 
speech modeling", col. 4, lines 3-36; "Alabama female accent", col. 7, lines 5-13; 
where Nguyen teaches the use of Gaussians in hidden markov models and so 
suggests where the Gaussians are what are adapted in a model) 

Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Chang to include the teaching of Nguyen of the 
standard model creating apparatus using a probability model that expresses a 
frequency parameter showing an acoustic characteristic as an output probability, 
where the reference models are probability models, where the parameters are 
statistics, wherein the plurality of reference models and the standard model are 
expressed using at least one Gaussian distribution, and said standard model 
structure determining unit is operable to determine at least a number of Gaussian 
mixture distributions as the structure of the standard model, in order to facilitate 
quick speaker adaptation of models, as described by Nguyen (col. 1, lines 7-10; 
col. 1, lines 60-66). 

Chang, in view of Nguyen, fail to teach the structure of the standard model 
is created based on specification information regarding specifications of the 
electronic apparatus. 

Rigazio suggests the structure of the standard model is created based on 
specification information regarding specifications of the electronic apparatus 
("different sets of Gaussians... recognizers with more sophisticated... large 
amount of memory... embedded systems and speech-enabled consumer 
products... do not have much memory or processing power to spare... reducing 
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the number of Gaussians", col. 1 , lines 17-49; where Rigazio suggests tailoring 
the size [and therefore, the structure] of a model because certain devices cannot 
handle recognition models with large quantities of data). 

Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Chang, in view of Nguyen, to include the teaching 
of Rigazio of the structure of the standard model is created based on 
specification information regarding specifications of the electronic apparatus, in 
order to not overburden certain devices, as described by Rigazio (col. 1 , lines 38- 
49). 

As per Claim 30, Chang, in view of Nguyen, fail to teach wherein the 
specification information indicates at least one of a type of an application 
program running on the electronic apparatus and specifications of the electronic 
apparatus. 

Rigazio wherein the specification information indicates at least one of a 
type of an application program running on the electronic apparatus and 
specifications of the electronic apparatus ("different sets of Gaussians... 
recognizers with more sophisticated... large amount of memory... embedded 
systems and speech-enabled consumer products... do not have much memory 
or processing power to spare... reducing the number of Gaussians", col. 1, lines 
17-49; where Rigazio suggests tailoring the size [and therefore, the structure] of 
a model because certain devices cannot handle recognition models with large 
quantities of data). 
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Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Chang, in view of Nguyen, to include the teaching 
of Rigazio of wherein the specification information indicates at least one of a type 
of an application program running on the electronic apparatus and specifications 
of the electronic apparatus, in order to not overburden certain devices, as 
described by Rigazio (col. 1 , lines 38-49). 

As per Claim 32, Chang suggests a specification information holding unit 
configured to store an application/specifications correspondence database 
showing a correspondence between an application program which uses the 
standard model and specifications of the standard model, wherein said standard 
model structure determining unit is configured to read specifications 
corresponding to an application program to be activated from the 
application/specifications correspondence database held by said specification 
information holding unit, and to determine the structure of the standard model 
based on the read specifications ("recognition model... acoustic model... 
customized to a user", paragraph 1 ; "parameters for possible cohorts are 
generated", paragraph 53; "modifies the parameters... using the parameters in 
the... cohort models", paragraph 57; where it is known that computers have 
multiple users, and so it is obvious to store some sort of correspondence 
between the different users and their respective models, and to use the correct 
model when a corresponding user desires to use the recognition system). 
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As per Claim 33, Chang teaches a specification information creating unit 
configured to create the specification information, wherein said model structure 
determining unit is configured to determine the structure of the standard model 
based on the created specification information ("recognition model... acoustic 
model... customized to a user", paragraph 1; where knowledge of the user who 
the model is to be adapted for is obvious to indicate to the system). 

As per Claim 34, Chang suggests wherein the standard model creating 
apparatus is connected to a terminal apparatus via a communication channel, 
and further comprises: an information receiving unit configured to receive the 
information from the terminal apparatus, wherein said standard model structure 
determining unit is configured to determine the structure of the standard model 
based on the received information ("input speech samples", paragraph 39; 
"remote computer", paragraph 34; "recognition model... acoustic model... 
customized to a user", paragraph 1; "enrollment data", paragraph 39; "selects the 
speakers... closest to enrollment data", paragraph 42; where the input speech 
receiving device can be a terminal) 

Chang, in view of Nguyen, fail to where the information is specification 
information. 

Rigazio where the information is specification information ("different sets 
of Gaussians... recognizers with more sophisticated... large amount of 
memory... embedded systems and speech-enabled consumer products... do not 
have much memory or processing power to spare... reducing the number of 
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Gaussians", col. 1, lines 17-49; where Rigazio suggests tailoring the size [and 
therefore, the structure] of a model because certain devices cannot handle 
recognition models with large quantities of data). 

Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Chang, in view of Nguyen, to include the teaching 
of Rigazio of where the information is specification information, in order to not 
overburden certain devices, as described by Rigazio (col. 1, lines 38-49). 

As per Claim 46-47, their limitations are similar to those in Claim 29, and 
so are rejected under similar rationale. 

2. Claim 59 is rejected under 35 U.S.C. 103(a) as being unpatentable over 
Chang, in view of Nguyen and Rigazio, as applied to Claim 33, above, and 
further in view of Leggetter et al. (US 7,191 ,130), hereafter Leggetter. 

As per Claim 59, Chang, in view of Nguyen, fail to teach wherein said 
specification information unit is configured to create the specification information 
with an N1 number of the Gaussian distributions, and to create the specification 
information with an N2 (>N1) number of the Gaussian distributions, and said 
standard model structure determining unit is configured to determine the number 
of the Gaussian mixture distributions according to the specification information 
created by said specification information creating unit. 
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Rigazio wherein said specification information unit is configured to create 
the specification information with an N1 number of the Gaussian distributions, 
and to create the specification information with an N2 (>N1) number of the 
Gaussian distributions, and said standard model structure determining unit is 
configured to determine the number of the Gaussian mixture distributions 
according to the specification information created by said specification 
information creating unit ("different sets of Gaussians... recognizers with more 
sophisticated... large amount of memory... embedded systems and speech- 
enabled consumer products... do not have much memory or processing power to 
spare... reducing the number of Gaussians", col. 1, lines 17-49; where Rigazio 
suggests tailoring the size [and therefore, the structure] of a model because 
certain devices cannot handle recognition models with large quantities of data). 

Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Chang, in view of Nguyen, to include the teaching 
of Rigazio of wherein said specification information unit is configured to create 
the specification information with an N1 number of the Gaussian distributions, 
and to create the specification information with an N2 (>N1) number of the 
Gaussian distributions, and said standard model structure determining unit is 
configured to determine the number of the Gaussian mixture distributions 
according to the specification information created by said specification 
information creating unit, in order to not overburden certain devices, as described 
by Rigazio (col. 1, lines 38-49). 
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Chang, in view of Nguyen and Rigazio, fail to teach where less gaussians 
are used when an instruction that the electronic apparatus is to perform quick 
speech recognition is obtained from the user, and more Gaussians are used 
when an instruction that the electronic apparatus is to perform precise speech 
recognition is obtained from the user. 

Leggetter suggests where less gaussians are used when an instruction 
that the electronic apparatus is to perform quick speech recognition is obtained 
from the user, and more Gaussians are used when an instruction that the 
electronic apparatus is to perform precise speech recognition is obtained from 
the user ("speech recognition systems... memory, speed, and accuracy can be 
traded off... less CPU resources... cost of lower average... accuracy", col. 1, 
lines 29-46; "configuration specifies a speech/accuracy... performance level... 
recognition is very fast, but less accurate... slow, and most accurate... frustrated 
users", col. 5, lines 13-36; where Leggetter suggests balancing based on what is 
needed [accuracy/speed], and also teaches where the preferred balance is 
determined based on caller/user expectations, and so suggests where the 
performance level trade-off is determined by a user). 

Therefore, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Chang, in view of Nguyen and Rigazio, to include 
the teaching of Leggetter of where less gaussians are used when an instruction 
that the electronic apparatus is to perform quick speech recognition is obtained 
from the user, and more Gaussians are used when an instruction that the 
electronic apparatus is to perform precise speech recognition is obtained from 
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the user, in order to ensure that users are not frustrated by the system, as 
described by Leggetter (col. 5, lines 24-36). 

Allowable Subject Matter 

3. The following is a statement of reasons for the indication of allowable 
subject matter: 

The prior art of record does not teach the equations in Claim 60. 
Conclusion 

4. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. See PTO-892. 

5. 

6. Applicant's amendment necessitated the new ground(s) of rejection 
presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. 
See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as 
set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire 
THREE MONTHS from the mailing date of this action. In the event a first reply is 
filed within TWO MONTHS of the mailing date of this final action and the advisory 
action is not mailed until after the end of the THREE-MONTH shortened statutory 
period, then the shortened statutory period will expire on the date the advisory 
action is mailed, and any extension fee pursuant to 37 CFR 1.136(a) will be 
calculated from the mailing date of the advisory action. In no event, however, will 



Application/Control Number: 10/534,869 
Art Unit: 2626 

the statutory period for reply expire later 
final action. 
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